Handling Speech Repairs and Other Disruptions Through Parser Metarules

نویسندگان

  • Mark G. Core
  • Lenhart K. Schubert
چکیده

Mixed-initiative dialogs often contain interruptions in phrase structure such as repairs and backchannel responses. Phrase structure as traditionally defined does not accommodate such phenomena, so it is not surprising that phrase structure parsers are ill-equipped to handle them. This paper presents metarules that specify how phrase structure rules may be restarted or interrupted (including overlapping speech). In the case of overlapping speech or a backchannel response, the metarules allow a constituent to overlap or be embedded inside another constituent that it is unconnected to. In the case of repairs, the metarules operate on the reparandum (what is being repaired) and alteration (the correction) to build parallel phrase structure trees: one with the reparandum and one with the alteration. Consider the utterance, take the banum the oranges. The repair metarule would build two VPs, one being take the banand the other being take the oranges. The introduction of metarules simplifies the notion of an utterance since a sentence interrupted by an acknowledgment such as okay can still be one utterance formed around the interrupting acknowledgment. Together metarules and phrase structure rules specify the structures that should be accommodated by a parser for mixed initiative dialogs. A dialog parser should also maintain a dialog chart that stores the results of syntactic and semantic analysis of all of the dialog seen so far. This dialog chart will be a shared resource eliminating the need for maintenance of a separate representation of dialog structure by a dialog manager. In addition, the dialog parser can alert the dialog manager to utterances introducing obligations as well as recognizing acknowledgments and responses based on syntactic information.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Implementing Parser Metarules that Handle Speech Repairs and Other Disruptions

Mixed-initiative dialogs often contain interruptions in phrase structure such as repairs and backchannel responses. Phrase structure as traditionally de ned does not accommodate such phenomena, so it is not surprising that phrase structure parsers are ill-equipped to handle them. This paper presents metarules that specify how the instantiations of phrase structure rules may be restarted or inte...

متن کامل

A Syntactic Framework for Speech Repairs and Other Disruptions

This paper presents a grammatical and processing framework for handling the repairs, hesitations, and other interruptions in natural human dialog. The proposed framework has proved adequate for a collection of human-human task-oriented dialogs, both in a full manual examination of the corpus, and in tests with a parser capable of parsing some of that corpus. This parser can also correct a pre-p...

متن کامل

A Model of Speech Repairs and Other Disruptions

Most dialog systems ignore the problem of speech repairs and editing terms (urn, uh, etc.) or use preprocessing techniques to eliminate them from the input. These systems also typically enforce a strict turn-taking protocol that does not allow speakers to interrupt each other. This paper describes a parser that can process input containing editing terms, speech repairs, and second speaker inter...

متن کامل

Detecting and Correcting Speech Repairs

Interactive spoken dialog provides many new challenges for spoken language systems. One of the most critical is the prevalence of speech repairs. This paper presents an algorithm that detects and corrects speech repairs based on finding the repair pattern. The repair pattern is built by finding word matches and word replacements, and identifying fragments and editing terms. Rather than using a ...

متن کامل

Studying impressive parameters on the performance of Persian probabilistic context free grammar parser

In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002